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(57) Abstract 

An immunogenic detoxified protein 
comprising the amino add sequence of subunxt 
A of cholera toxin (CT-A) or snbunit A or an 
Escherichia colt beat labile toxin (LT-A) or a 
fragment thereof wherein one or more amino 
adds at, or in positions corresponding to Val- 
S3, Ser-63, Val-97. Tyr-104 or Pro-106 are re- 
placed with another amnio acid or deleted. Ex- 
amples of specific replacements include Val- 
53-Asft Val43-GJhi, Val.53.TJr, Ser-63-Lys, 
Val47~Lys, VaW7-T>r, Tyr-lW-Ije, Tyr- 
104- Asp, TVr-104-Ser, Pro-106-Ser. The immu- 
nogenic detoxified protein is useful as vaccine 
tor Vibrio cholerae or an enterotoxigenic strain 
of Escherichia eoU and is produced by feoanv 
hfjumt TWA means by jlte-directed mutaneno- 
sis. 
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Immunogenic detoxified mutants of cholera toxin and of the 

tffiSiU UL* ttmr preparation and their use for the 

preparation of vaccines 

5 rielfl ffg the Invention 

The present invention relates to immunogenic detoxified 
proteins of cholera toxins (CT) , or of heat labile toxins 
(LT) produced by the enterotoxigenic strains of -Escherichia 

10 coli (E.coli) having substitutions at one or more of amino 
acids Val-53, Ser-63, Val-97, Tyr-104 or Pro-106 and to 
their use in vaccines which are useful for the prevention or 
treatment of cholera or enterotoxigenic E.Coli infections. 
The proteins can be suitably produced using recombinant DNA 

15 techniques by site-directed mutagenesis of DNA encoding the 
wild type toxins. 

BapfegroTmti the Enysntifln 

20 Cholera is a contagious disease widely distributed in the 
world, in particular in the Third World, where, in certain 
areas, it is endemic. The serious disorders which develop in 
the Intestinal system prove fatal in a high percentage of 
the recorded cases of the disease . 

25 

The etiological agent of cholera is the Gram-negative 
microorganism Vibrio cholerae (V. choleras) . This colonises 
the intestinal tract of individuals who have come into 
contact with it through ingestion of contaminated food or 

30 water, and multiplies to very high concentrations. The 
principal symptom is severe diarrhoea as a result of which 
the patient can lose as much as 10*15 litres of liquids per 
day via the faeces. As a result of the severe dehydration 
and loss of electrolytes, the patient does not withstand the 

35 infection in 50-60% of oases, and dies. The diarrhoea caused 
by V. cholerae is due to the secretion of cholera toxin, CT, 
which acts by stimulating the activity of the adenylate 
cyclase enzyme so as to induce disturbances at cell level. 
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Alth ugh cholera can be effectively cur d by controlled and 
intense rehydrati n, the distribution of a vaccine is 
desirable vith a view to complete control and futur 
eradication of the disease* 

5 

At the present time, there exists a vaccination against 
cholera, consisting of parenteral administration of killed 
bacteria. Although some countries insist on vaccination 
against the disease, there are serious doubts as to its real 
10 usefulness, given that the current cellular vaccine protects 
against the consequences of the infection in only 50* of the 
cases and that the protection is also extremely limited in 
duration, to less than 6 months. 

15 in Bangladesh, an experimental trial is in progress (1990- 
92) of an oral vaccine consisting of killed bacteria vith 
the addition of subunit B of cholera toxin, which is known 
to be highly immunogenic. This product succeeds in inducing 
lasting protection, without special side effects (Holmgren 

20 J., Clemens J. , Sack DA* , Sanchez J. and Svennerholm AM; 
ttQral Immunization against cholera" Curr. Top. Microbiol. 
Immunol. (1988), 146, 197-204). 

Cholera toxin resembles the heat labile toxins of 
25 enterotoxigenic strains of Escherichia coli in amino acid 
sequence, structure and mode of action. 

The consequences of infection with an enterotoxigenic strain 
of B.coli are similar to, though less serious than, those of 
30 cholera, and consist of severe diarrhoea and intestinal 
disorders. 

The CP and LT toxins all comprise a single A subunit (or 
protomer A) responsible for the enzymic activity of the 
35 toxin (herein CT-A or LT-A) and five identical B subunits 
{or protomer B) which are involved in the binding of the 
toxin to the intestinal epithelial cells (herein CT-B or LT- 
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The A subunit penetrates the cell membran and causes 
activation of adenylate cyclase by NAD-dep ndent ADP- 
ribosylation of a GTP-binding protein which controls the 
5 activity of the enzyme. The clinical effect of this is to 
cause massive fluid loss into the intestine. 

Considerable research has been conducted on cholera toxin 
and the S. coll heat labile toxins. 

10 

The sequence of CT is known and has been described 
(Melcalanos J.J, et al Nature 2M, page 551 (1*83)). 

The sequence of LT from enterotoxigenic strains of E.coli 
15 is, as mentioned, 80% homologous to CT and it too is known 
and described in the scientific literature. Spicer B.X. et 
al (Biol. Chem. 251 p. 5716-5721 (1982)) describe the amino 
acid sequence of the A sub unit of the heat labile toxin 
from an enterotoxigenic strain of E. coli found in pigs. 

20 

A bacterial chromosomal form of LT has been identified and 
sequenced by Pickett C.L. et al (J. Bacterid. 5180- 
5187, (1987). 

25 The sequence of the A subunit of LT from a strain of E. coli 
known to affect humans has also been sequenced (Yamamoto et 
al, J. Biol. Chem., 252/ 5037-5044, (1984)). 

In view of the potential clinical significance of a vaccine 
30 against cholera and enterotoxigenic bacteria there is a 
continuing and great interest in producing a detoxified 
toxin capable of immunising against cholera and 
enterotoxigenic bacteria. The techniques of genetic 
engineering allow specific mutations to be introduced into 
35 the genes encoding the toxins and the production of the 
mutated toxins using now conventional techniques of gene 
expression and protein purification. 
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Various groups hav attempted to identify mit&ti ns of the 
genes, wbich involve loss of the toxicity characteristics of 
the encoded prot ins. The studies are predominantly being 
carried out in respect of the gene for the toxin LT, from X. 
5 coll. I 

Harford, s. et al (Eur. J. Biochem. 132, page 311 (1989)) 
describe the production of a toxoid by in vitro mutagenesis 
of the LT-A gene from E.coli pathogenic for pigs. The 

10 resulting successful mutation contained a ser-61-Phe 
substitution and a Gly-79-Lys substitution, the former being 
considered the more important. Harford et al suggest that, 
because of the similarities between the LT-A genes in S.coli 
pathogenic to humans and pigs and the CT-A gene, and because 

15 the toxins are thought to operate by a common mechanism, it 
may be possible to produce a cholera holotoxoid by 
introducing the Ser-61-Ehe mutation into the CT-A gene. 

Tsuji, T. et al (J. Biol. Chem. 2££, p. 22520 (1990)) 
20 describe the mutation of the LT-A gene from plasmid EWD299 
to produce a single substitution Glu-112-Lys which affects 
the toxicity of the mutant LT yet does not change the 
inMinogenicity of the protein. 

25 Grant, C.C.B. et al (Abstract B289 of the 92nd General 
meeting of the American Society for Microbiology, 2 6-3 Oth. 
may 1992) describe conservative substitutions of histidines 
at 44 and 70 and tryptophan at 127 in LT-A which result in 
significant reductions in enzymic activity. 

30 

Some work has been conducted on mutations to CT. 

Kaslow, H.R. et al (Abstract B291 of the 92nd General 
Meeting of the American Society for Microbiology, 26-30th 
35 May 1992) describe mutating Asp-9 and His-44 and truncating 
after «tnn acid 18 o in CT-A which all essentially eliminate 
activity, nutating Arg-9 is said to markedly attenuate 
activity. Mutating other amino acid sites had little ffect 
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on toxicity. 

Burnette, N.N. t ai (Inf. and Immun. £2HH, 4266-4270, 
(1991)) describe site-specific mutagen sis of CT-A to 
5 produce an Arg-7-Lys mutation paralleling that of a known 
detoxifying mutation in the A subunit of the Bordetella 
pertussis toxin. The mutation resulted in the complete 
abolition of detectable ADP-ribosyltransferase activity. 

10 International patent application WO 92/19265 (Bumette, 
Kaslov and Amgen Inc.) describes nutations of CT-A at Arg- 
7, Asp-9, Arg-ll, His-44, His-70 and Glu-112. 

Mutations at Glu-110 (LT and CT) and Arg-146 (LT) have also 
15 been described in the literature (Lobet, Inf. Immun., 2870, 
1991; Lai, Biochem. Biophys. Res. Comm. 341 1983; Okamoto J. 
Bacteriol. 2208, 1988). 

The crystal structure of LT has been determined by Sixma et 
20 al (Nature, 351, 371-377, May 1991) and confirms the 
mutatagenesis results described earlier in the literature, 
explaining structurally the significance of Glu-112 and Ser- 
61 in activity of the A sub unit and suggesting that His- 
44, Ser-114 and Arg-54 which are in the immediate 
25 neighbourhood may be important for catalysis or recognition. 

fimmarv of the invention 

It has now been discovered by further and more detailed 
30 analysis of the structure of the toxins that certain further 
amino acids in the sequences of CT-A and LT-A are in 
positions capable of decreasing the enzymatic activity of 
CT and LT when mutated suitably, individually or in 
conjunction with other mutations. 
35 * 

The object of the present invention is to provide a vaccine 
which gives total protection against Cholera or 
enterotoxigenic £. coll, by m ans of a second generation 
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product consisting of a single antigen, a toxoid derived 
from CT ar IT, which has been detoxified genetically. 

The genetic detoxification of CT or LT retains the 
5 immunogenic properties of the toxoid whilst providing a 
significantly reduced and preferably absent toxicity. 

According to a first aspect of the invention there is 
provided an immunogenic detoxified protein comprising the 

10 amino acid sequence of subunit A of a cholera toxin (CT-A) 
or a fragment thereof or subunit A of an Escherichia coli 
heat labile toxin (LT-A) or a fragment thereof, wherein one 
or more amino acids at, or in positions corresponding to 
Val-53 , Ser-63 , Val-97, Tyr-104 or Pro-106 are replaced with 

15 another amino acid. 

The replaced amino acids are at locations in the sequences 
of CT-A or an LT-A which are conserved both in the amino 
acid sequence and structurally and are thus common to CT 
20 and the various LTs. 

The immunogenic detoxified protein of the invention adopts 
substantially the same structural conformation as the wild 
type naturally occuring toxins. It is immunologically 
25 active and cross reacts with antibodies to the wild type 
toxins. 

In this specification, references to CT and LT encompass the 
various naturally occurring strain variants as- well other 
30 variants encompassing changes from the sequences disclosed 
herein which do not affect the immunogenicity of the 
assembled toxoid. 

In this specification, references to amino acid coordinates 
35 such as D Val-97 a connote the amino acid at that position in 
the sequence of the mature cholera toxin subunit A (CT-A) , 
that is without the signal sequence (see Figure 1) . 
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Where th ep cif icatton refers to an LT-A, the amino acid 
coordinates refer to the c rresponding position in CT-A as 
shown in Figure 1. 

5 Thus, for example, val<-53 in CT corresponds to Val-52 in the 
LTl subunit and ser-63 in CT corresponds to Ser-62 in LTl, 
there being a single amino acid difference in mothering up 
to amino acid 89 of the LTl sequence. VaX-97 in the CT 
sequence corresponds to Val-93 in the LTl sequence because 
10 of the four amino acid difference at that point in the 
sequence * 

In addition, the immunogenic detoxified protein of the 
invention may include other mutations such as, for example, 
15 substitutions at one or more of Arg-7, Asp-9, Arg-11, His- 
44, Arg-54, Ser-61, His-70, His-107, Glu-110, Glu-112, Ser- 
114/ Trp-127, Arg-146 or Arg-192. 

The amino acid substituted for the vild type amino acid may 
20 be a naturally occurring amino acid or may a modified or 
synthetic amino acid. The substitution may involve deletion 
of an amino acid altogether provided that the mutant retains 
the necessary immunogenic properties and exhibits a 
substantially reduced toxicity. 

25 

Substitutions vhich alter the amphotericity and 
hydrophilicity vhilst retaining the steric effect of the 
substituting amino acid as far as possible are generally 
preferred. 

30 

Preferred substitutions include; Val-53-Asp, Val-53-Glu, 
Val-53-Tyr, Ser-63-Lys, Val-97-Lys, Val-97-Tyr, His-107-Glu, 
Tyr-104-Lya, Tyr-104-Asp, Tyr-104-Ser, Pro-106-Ser, Ser-114-. 
Clu,. Ser-114 -Lys. 

35 

As used herein, the term "detoxified" means that the 
Immunogenic composition exhibits a substantially lover 
toxicity relative to its naturally occurring toxin 
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counterpart. The substantially lower toxicity should be 
sufficiently low for the protein to b us d in an 
immun genie composition in an immunologically off ctive 
amount as a vaccine with causing significant side effects. 
5 For example, the immunogenic detoxified protein should have 
a toxicity of leas than 0.01% of the naturally occurring 
toxin counterpart. The toxicity nay be measured in mouse 
CHD cells or preferably by evaluation of the morphological 
changes induced in Yl cells. The term "toxoid" means a 
10 genetically detoxified toxin. 

The immunogenic protein may be a CT or LT subunit A toxoid, 
but is preferably an assembled toxin molecule comprising a 
mutated CI-A or LT-A subunit and five B subunits of CT or 
15 LT* The B subunit may be a naturally occurring subunit or 
'may itself be mutated. 

The immunogenic protein is preferably a naturally occurring 
CT-A or an LT-A suitably modified as described above. 

20 However, conservative amino acid changes may be made which 
do not affect the immunogenicity or the toxicity of 
immunogenic protein and preferably do not affect the ability 
of the immunogenic protein to form complete toxin with P 
subunit protein. Also, the immunogenic protein may be a 

25 fragment of CT-A or an LT-A provided that the fragment is 
immunogenic and non toxic and contains at least one of the 
conserved regions containing one of the mutations according 
to the invention. 

30 According to a second aspect of the invention, there is 
provided an immunogenic composition for use as a vaccine 
comprising an immunogenic detoxified protein of the first 
aspect of the invention and a pharmaceutical^ acceptable 
carrier. 

35 

The immunogenic composition may additionally contain one or 
more adjuvants and/or pharmaceutical^ acceptable diluents. 
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The invention also provides a vaccine composition comprising 
n immunogenic detoxifi d protein according to the first 
aspect of the inventi n and a pharmaceutical^ acceptable 
carrier. The vaccine composition may further comprise an 
5 adjuvant* 

According to a third aspect of the invention, there is 
provided a method of vaccinating a mammal against Vibrio 
choleras or an enterotoxigenic strain of Escherichia coli 
10 comprising administering an immunologically effective amount 
of an immunogenic detoxified protein according to the first 
aspect of the invention. 

The immunogenic detoxified proteins of the invention may be 
15 synthesised chemically using conventional peptide synthesis 
techniques r but are preferably produced by recombinant DNA 
means. 

According to a fourth aspect of the invention there is 
20 provided a DNA sequence encoding an immunogenic detoxified 
protein according to the first aspect of the invention. 

Preferably the DNA sequence contains a DNA sequence encoding 
a complete CT or LT comprising DNA encoding both the 
25 detoxified subunit A and subunit B in a polycistronic unit. 
Alternatively, the DNA may encode only the detoxified 
. subunit A. 

According to a fifth aspect of the invention, there is 
30 provided a vector carrying a DNA according to the fourth 
aspect of the invention. 

According to a sixth aspect of the invention, there is 
provided a host cell line transformed with .the vector 
35 Recording to the fifth aspect of the invention. 

The host cell may be any host capable of producing CT or LT 
but is preferably a bacterium, aost suitably B.coli or 
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V.chaleirae suitable engineered to produce the desired 
immunogenic detoxified protein. 

in a further embodiment of the sixth aspect of the 
5 invention, the host cell may itself provide a protective 
species, for example an E.coli or V.cholaraa strain mutated 
to a phenotype lacking wild type LT or CI and carrying and 
expressing an immunogenic detoxified protein of the first 
aspect of the invention. 

10 

In a further embodiment of the sixth aspect of the invention 
the host cell is capable of expressing a chromosomal LSP-A 
gene according to the first aspect of the invention. 

15 According to a seventh aspect of the invention, there is 
provided a process for the production of an immunogenic 
detoxified protein according to the first aspect of the 
invention comprising culturing a host cell according to the 
sixth aspect of the invention. 

20 

According to a eighth aspect of the invention there is 
provided a process for the production of DHA according to 
the fourth aspect of the invention comprising the steps of 
subjecting a DHA encoding a CT-A or an LOP-A or a fragment 
25 thereof to site-directed mutagenesis. 

According to a ninth aspect of the invention there is 
provided a process for the formulation of a vaccine 
comprising bringing an immunogenic detoxified protein 
30 according to the first aspect of the invention into 
association with a pharaaceutically acceptable carrier and 
optionally with an adjuvant. 

TntMstTHai Applicability 

35 

The immunogenic detoxified protein of the invention 
constitutes the active component of a vaccine composition 
useful for the pr vention and treatment of cholera 
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infections or infections by enterotoxigenic strains f 
E.coll. The compositi ns ar thus applicable for us in th 
pharmaceutical industry. 

5 Brief Desc ription of the Drawings 

Figure l shows the amino acid sequences of the wild type 
subunit A from: 

10 i) cholera toxin (CT - Hekalanos et al op cit) , 

ii) heat labile toxin from an B.coli strain found in man 
(I/P1_1 - Yamamoto ©t al op cit) 

iii) heat labile toxin from an JS.aoll strain found in pigs 
(LT1- Spicer et al op cit) , and 

15 iv) heat labile toxin from a chromosomal source (LT1_1 - 
Pickett et al op cit) 

The signal sequences are not shown. 

20 In Figure 1, the conventional single letter amino acid code 
is used. The symbol denotes an absent amino acid and 
acts as a typographical spacer to ensure that the sequences 
remain in alignment for ease of comparison. The symbol "- 
■ indicates an amino acid in the sequences of LT1 and LT2 

25 which is identical to the corresponding amino acid in CT. 
The numbers against each line are the amino acid number of 
the first amino acid on that line. 

In Figure 1 the positions of the mutations of the present 
30 invention are shown underlined. 

Figures 2a and 2b are comparisons of the amino acid and DNA 
sequences of the A sub units of LT1 and CT. 

35 Figure 3 is a restriction map of plasmid EWD299 (Dallas et 
al), bearing the LT-A gene. 

patailed Description of Embodiments of the Invention 
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•The practice of th pr sent invention will employ, unless 
otherwise indicated, conventional techniques of molecular 
biology, microbiology, recombinant DNA, and immunology, 
5 which are within the skill of the art. Such techniques are 
explained fully in the literature. £fi& e.g., Sambrook, et 
al. , MOLECULAR CLONING; A LABORATORY MANUAL, SECOND EDITION 
(1989); DNA CLONING, VOLUMES I AND II (D.N Glover ed. 
1985)7 OLIGONUCLEOTIDE SYNTHESIS (M.J. Gait ed, 1984); 

10 NUCLEIC ACID HYBRIDIZATION (B.D. Hames & S.J. Higgins edS. 
1984); TRANSCRIPTION AND TRANSLATION (B.D. Haxes & S.J. 
Higgins eds. 1984); ANIMAL CELL CULTURE (R.I. Preshney ed. 
198$) ; IMMOBILIZED CELLS AND ENZYMES (IRL Frees, 1986) ; B. 
Perbal, A PRACTICAL GUIDE TO MOLECULAR CLONING (1984); the 

15 series, METHODS IN ENZYMOLOGY (Academic Press, me); GENE 
TRANSFER VECTORS FOR MAMMALIAN CELLS (J.H. Killer and M.P. 
Calos eds. 1387, cold Spring Harbor Laboratory), Methods in 
Bnzymology Vol. 154 and Vol. 155 (Wu and Grossman, and Wu, 
eds. r respectively), Mayer and Walker, eds. (1987), 

20 IMMUNOCHEMICAL METHODS IN CELL AND MOLECULAR BIOLOGY 
(Academic Press, London), Scopes, (1987), PROTEIN 
PURIFICATION: PRINCIPLES AND PRACTICE, Second Edition 
(Springer-Verlag, N.Y.), and HANDBOOK OF EXPERIMENTAL IM- 
MUNOLOGY, VOLUMES I-IV (D.M. Weir and C. C. Blackwell eds 

25 1986) . 

Standard abbreviations for nucleotides and amino acids are 
used in this specification. All publications .patents, and 
patent applications cited herein are incorporated by 
30 reference. 

In particular, the following amino acid abbreviations are 



used: 



35 Alanine A 

Arginine R 

Asparagine N 

Aspartic Acid D 



Asp 



Arg 
Asn 
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Cysteine 


c 


Cys 




Glycine 


G 


Gly 




Glutamic Acid 


E 


Glu 




Glutamine 


Q 


Gin 


5 


Hietidine 


H 


Hie 




leoleucine 


I 


lie 




Leucine 


L 


Leu 




Lysine 


K 


Lys 




Methionine 


M 


Met 


10 


Phenylalanine 


F 


Phe 




Proline 


P 


Pro 




Serine 


S 


Ser 




Threonine 


T 


Thr 




Tryptophan 


W 


Trp 


15 


Tyrosine 


y 


Tyr 




Valine 


V 


Val 



As mentioned above examples of the immunogenic detoxified 
protein that can be used in the present invention include 
20 polypeptides with minor amino acid variations from the 
natural amino acid sequence of the protein other than at the 
sites of mutation specifically mentioned. 

A signif leant advantage of producing the immunogenic 
25 detoxified protein by recombinant DMA techniques rather than 
by isolating and purifying a protein from natural sources is 
that equivalent quantities of the protein can be produced by 
using less starting material than vould be required for 
Isolating the protein from a natural source, producing the 
30 protein by recombinant techniques also permits the protein 
to be isolated in the absence of some molecules normally 
present in cells. Indeed, protein compositions entirely 
free of any trace of human protein contaminants can readily 
be produced because the only human protein produced by the 
35 recombinant non-human host is -the recombinant protein at 
issue* Potential viral agents from natural sources and 
viral components pathogenic to humans are also avoided* 
Also, genetically d toxif ied toxin ar 1 ss likely to revert 
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to a toxic frost than nor traditional, chemically detoxified 
toxins . 

Pharmaceutically acceptable carriers include any carrier 
5 that does not itself induce the production of antibodies 
harmful to the individual receiving the composition. 
Suitable carriers are typically large, slowly metabolized 
macromolecules such as proteins, polysaccharides, polylactic 
acids, polyglycolic acids, polymeric amino acids, amino acid 
10 copolymers, lipid aggregates (such as oil droplets or 
liposomes) and inactive virus particles. Such carriers are 
well known to those of ordinary skill in the art. 
Additionally, these carriers may function as 
immunostimulating agents (adjuvants). 

15 

Preferred adjuvants to enhance effectiveness of the compo- 
sition include, but are not limited to: aluminum salts 
(alum) such as aluminium hydroxide, aluminium phosphate, 
aluminium sulfate etc., oil emulsion formulations, with or 

20 without other specific immunostimulating agents such as 
muxamyl peptides or bacterial cell wall components, such as 
for example (1) MF59 (Published international patent 
application WO-A-9 0/148 37, containing 5% Sgualene, 0.5% 
Tween© 80, 0.5% Span* 85 (optionally containing various 

25 amounts of MTP-PB (see below), although not required) 
formulated into submicron particles using a microfluidizer 
such as Model HOY microfluidizer (Microfluidics, Newton, MA 
02164), (2) SAP, containing 10% sgualene, 0.4% Tween 80, 5% 
pluronic-blocked polymer L121, and thr-KDP (see below) 

30 either microfluidized into a submicron emulsion or vortexed 
to generate a larger particle size emulsion, and (3) RIBI" 
adjuvant system (HAS) (Sibi Imnunochem, Hamilton, MI) 
containing 2% Sgualene, 0.2% Tween® 80 and one or more 
bacterial cell wall components from the group consisting of 

35 monophosphoryl lipid A (MM.), trehalose dimycolate (TDM) , 
and cell wall skeleton (CHS) preferably MPL+CW6 (Detox"), 
auramyl peptides such as N-acetyl-muramyl-L-threonyl-D- 
isoglutamine (thr-KDP) , N-acetyl-normuramyl-L-alanyl-o-iso- 
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glutamine (nor-MDP) , N-acetylmuramyl-i,-alanyl-D«- 
isoglutaminy l-L-alanine-2- ( l 1 -2 f -dipalmitoyl-sn-glycero-3- 
hydroxyphosphoryloxy) -ethylamine (MTP-PE) etc., and 
cytokines, such as inter leuKins (IL-l, IL-2 etc) macrophage 
5 colony stimulating factor (M-CSF) , tumour necrosis factor 
(THF) etc. Additionally, saponin adjuvants, such as 
Stimulon* (Cambridge Bioscience, Worcester, MA) may be used 
or particles generated therefrom such as ISCOMS 
(immunostimulating complexes) . Furthermore, Complete 
10 Freunds Adjuvant (CFA) and Incomplete Freunds Adjuvant (IFA) 
may be used. Alum and MF59 are preferred . 

The immunogenic compositions (e.g. the antigen, 
pharmaceutical!/ acceptable carrier and adjuvant) typically 
15 will contain diluents, such as water, saline, glycerol, 
ethanol, etc. Additionally, auxiliary substances, such as 
vetting or emulsifying agents, pH buffering substances, and 
the like, may be present in such vehicles. 

* 20 Typically, the immunogenic compositions are prepared as 
injectables, either as liquid solutions or suspensions; 
solid forms suitable for solution in, or suspension in, 
liquid vehicles prior to injection may also be prepared. 
The preparation also may be emulsified or encapsulated in 
25 liposomes for enhanced adjuvant effect as discussed above 
under pharmaceutical^ acceptable carriers* 

Immunogenic compositions used as vaccines comprise an 
immunologically effective amount of the antigenic 

30 polypeptides, as veil as any other of the above-mentioned 
components, as needed. By n immunologically effective 
amount" , it is meant that the administration of that amount 
to an individual, either in a single dose or as part of a 
series, 16 effective for treatment or prevention. This 

35 amount varies depending upon the health and physical 
condition of the individual to be treated, the taxonomic 
group of individual to be treated (e.g., nonhuman primate, 
primate* etc.), the capacity of the individual's Immune 
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system to synthesize antibodies, the degre of protection 
desired, the formulation of the vaccine, the treating 
doctor 1 s assessment of the medical situation, and other rel- 
evant factors. It is expected that the amount will fall in 
5 a relatively broad range that can be determined through 
routine trials* 

The immunogenic compositions are conventionally administered 
parenterally, e.g* by injection either subcutanaously or 

10 intramuscularly. Additional formulations suitable for other 
modes of administration include oral and pulmonary 
formulations, suppositories and transdermal applications. 
Dosage treatment may be a single dose schedule or a multiple 
dose schedule. The vaccine may be administered in conjunc- 

15 tion with other immunoregulatory agents. 

The term "recombinant polynucleotide" as used herein intends 
a polynucleotide of genomic, cDNA, semisynthetic, or 
synthetic origin which, by virtue of its origin or 
20 manipulation: (1) is not associated with all or a portion of 
a polynucleotide with which it is associated in nature, (2) 
is linked to a polynucleotide other than that to which it is 
linked in nature, or (3) does not occur in nature. 

25 The term polynucleotide" as used herein refers to a 
polymeric form of nucleotides of any length, either 
ribonucleotides or deoxyribonucleotides . This term refers 
only to the primary structure of the molecule. Thus, this 
term includes double* and single-stranded DMA and RNA. It 

30 also includes known types of modifications, for example, 
labels which are known in the art, methylation, 11 caps", 
substitution of one or more of the naturally occurring 
nucleotides: with an analog, intemucleotide modifications 
such as, for example, those with uncharged linkages (e.g. , 

35 methyl phosphonates, phosphotriesters, phosphoamidates, 
carbamates, etc.) and with charged linkages (e.g., 
phosphorothioates, phosphorodithioates, etc.), those 
containing pendant moieties, auch as, f r exampl proteins 
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(including for e.g., nucleases, toxins r antibodies, signal 
peptides, poly-L- lysine, etc.), those with intercalators 
(e.g.; acridine, psoralen, etc.), thos containing chelat rs 
(e.g., ©eta Is, radioactive metals, boron, oxidative metals, 
5 etc.), those containing alkylators, those with modified 
linkages (e.g., alpha an oner ic nucleic acids, etc.) / as yell 
as unmodified forms of the polynucleotide* 

A "replicon" is any genetic element, e.g., a plasmid, a 
10 chromosome, a virus, a cosmid, etc. that behaves as em 
autonomous unit of polynucleotide replication within a cell; 
i.e., capable of replication under its own control. This 
may include selectable markers. 

15 A "vector" is a replicon in which another polynucleotide 
segment is attached, so as to bring about the replication 
and/ or expression of the attached segment. 

"Control sequence" refers to polynucleotide sequences which 
20 are necessary to effect the expression of coding sequences 
to which they are ligated. The nhture of such control 
sequences differs depending upon the host organism; in 
prokaryotes, such control sequences generally include 
promoter, ribosomal binding site, and transcription 
25 termination sequence; in eukaryotes, generally/ such control 
sequences include promoters and transcription termination 
sequence. The term "control sequences" is intended to 
include, at a minimum, all components Whose presence is 
necessary for expression , and may also include additional 
30 components whose presence is advantageous, for example, 
leader sequences and fusion partner sequences. 

"Operably linked" refers to a juxtaposition wherein the 
components so described are in a relationship -permitting 
35 them to function in their intended manner. A control 
sequence "operably linked" • to a coding sequence is ligated 
in such a way that expression of the coding sequence is 
achieved under c ndltions compatibl with the control 
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sequences* 

An "pen reading frame" (ORF) ie a region of a 
polynucleotide sequence which encodes a polypeptide; this 
5 region may represent a portion of a coding sequence or a 
total coding sequence. 

A "coding sequence" is a polynucleotide .sequence which is 
translated into a polypeptide, usually via mRNA, when placed 

10 under the control of appropriate regulatory sequences. The 
boundaries of the coding sequence are determined by a 
translation start codon at the 5 '-terminus and a translation 
stop codon at the 3 1 -terminus. A coding sequence can 
include, but is not limited to, cDHA, and recombinant 

15 polynucleotide sequences. 

"PCR" refers to the technique of polymerase chain reaction 
as described in Saiki, et al>, Mature 324:163 (1986); and 
Scharf et al., Science (1986) 233:1076-1078? and U.S. 
20 4 r 683, 195; and U.S. 4,683,202. 

As used herein, x is "heterologous" with respect to y if x 
is not naturally associated with y in the identical manner; 
i.e., x is not associated with y in nature or x is not 
25 associated with y in the same manner as is found in nature. 

"Homology" refers to the degree of similarity between x and 
y. The correspondence between the sequence from one form to 
another can be determined by techniques known in the art. 

30 Por example, they can be determined by a direct comparison 
of the sequence information of the polynucleotide. 
Alternatively, homology can be determined by hybridization 
of the polynucleotides under conditions which form stable 
duplexes between homologous regions (for exaniple, those 

35 which would be used prior to S x digestion) , followed by 
digestion with single-stranded specific nuclease(s), fol- 
lowed by size determination of the digested fragments. 
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As vised herein, th term "polypeptide" r fers to a p lymer 
of amino acids and does not refer t a specific 1 ngth f 
the product; thus, peptides, oligopeptides, and proteins are 
included within the definition of polypeptide. This team 
5 also does not refer to or exclude post expression 
modifications of the polypeptide, for example, 
glycosylations, acetylations, phosphorylations and the like. 
Included within the definition are, for example, 
polypeptides containing one or more analogs of an amino acid 
10 (including, for example, unnatural amino acids, etc.) r 
polypeptides with substituted linkages, as well as other 
modifications known in the art, both naturally occurring and 
non-naturally occurring. 

15 A polypeptide or amino acid sequence "derived from" a 
designated nucleic acid sequence refers to a polypeptide 
having an amino acid sequence identical to that of a 
polypeptide encoded in the sequence, or a portion thereof 
wherein the portion consists of at least 3-5 amino acids, 

20 and more preferably at least 8-10 amino acids, and erven more 
preferably at least 11-15 amino acids, or which is im- 
munologically identifiable with a polypeptide encoded in the 
sequence* This terminology also includes a polypeptide 
expressed froia a designated nucleic acid sequence. 

25 

The protein may be used for producing antibodies, either 
monoclonal or polyclonal, specific to the protein. The 
methods for producing these antibodies are known in the art. 

30 "Recombinant host cells", "host cells," "cells, 11 "cell 
cultures, * and other such terms denote, for example, 
microorganisms , Insect cells, and aanmallan cells, that can 
be, or have been, used as recipients for recombinant vector 
or other transfer DNA, and include the progeny of the 

35 original cell which has been transformed. It is understood 
that the progeny of a single parental cell may not 
necessarily be conplet ly identical in morph logy or In 
gen mic or t tal DNA complement as the original parent, due 
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to natural, accidental, or d liberate mutation. Examples 
for mammalian host cells Include Chines hanst r ovary (CHO) 
and monkey kidney (COS) cells. 

5 Specifically, as used herein, "cell line," refers to a 
population of cells capable of continuous or prolonged 
growth and division in vitro . Often, cell lines are clonal 
populations derived from a single progenitor cell. It is 
further known in the art that spontaneous or induced changes 

10 can occur in karyotype during storage or transfer of such 
clonal populations. Therefore, cells derived from the cell 
line referred to may not be precisely identical to the 
ancestral cells or cultures, and the cell line referred to 
includes such variants. The term "cell lines" also includes 

15 immortalized cells. Preferably, cell lines include 
nonhyhrid cell lines or hybridomas to only two cell types. 

As used herein, the term "microorganism" includes 
prokaryotic and eukaryotic microbial species such as 
20 bacteria and fungi, the latter including yeast and 
filamentous fungi. 

"Transformation" , as used herein, refers to the insertion of 
an exogenous polynucleotide into a host cell, irrespective 
25 of the method used for the insertion, for example, direct 
uptake # transduction, f -mating or electroporation. The 
exogenous polynucleotide may be maintained as a 
non-integrated vector, for example, a plasnid, or 
alternatively, may be integrated into the host genome. 

30 

By "genomic" is meant a collection or library of DNA 
molecules which are derived from restriction fragments that 
have been cloned in vectors. This may include all or part 
of the genetic material of an organism. 

35 

By "oONA" is meant a complementary DNA sequence that 
hybridizes to a complementary strand of DNA. 
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By "purif led" and "isolated" is meant, when referring to a 
polypeptide r nucleotide sequence, that the indicated 
molecule is present in the substantial absenc of other 
biological macromolecules of the same type. The term 
5 "purified" as used herein preferably means at least 75% by 
weight, more preferably at least 85% by weight r more 
preferably still at least 95% by weight, and most preferably 
at least 93% by weight, of biological macromolecules of the 
same type present (but water, buffers, and other small 
10 molecules, especially molecules having a molecular weight of 
less than 1000, can be present), 

once the appropriate coding sequence is isolated, it can be 
expressed in a variety of different expression systems; for 
15 example those used with mammalian cells, baculoviruees r 
bacteria, and yeast. 

i„ Mammalian Systems 

20 Mammalian expression systems are known in the art. A 
mammalian promoter is any DNA sequence capable Df binding 
mammalian RNA polymerase and initiating the downstream (3') 
transcription of a coding sequence (e.g. structural gene) 
into qRNA. a promoter will have a transcription initiating 

25 region, which is usually placed proximal to the 5' end of 
the coding sequence, and a TATA box, usually located 25-3 0 
base pairs (bp) upstream of the transcription initiation 
site. The TATA box is thought to direct RNA polymerase II 
to begin RNA synthesis at the correct site. A mammalian 

30 promoter will also contain an upstream promoter element, 
usually located within 100 to 200 bp upstream of the TATA 
box. An upstream promoter element determines the rate at 
which transcription is initiated and can act in either 
orientation [Sambrook et al. (1989) "Expression of Cloned 

33 Genes in Mammalian Cells." In Molecular Cloning: A 
Laboratory Manual. 2nd ed.1. 

Mammalian viral genes are ften highly xpressed and have a 
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broad host rang ; thererore sequences encoding mammalian 
viral genes provide particularly useful promoter sequences. 
Examples inalud the SV40 early promoter, noose mammary 
tumor virus LTR promoter, adenovirus major late promoter (Ad 
5 MLP) , and herpes simplex virus promoter. In addition, 
sequences derived from non-viral genes, such as the murine 
aetallotaeionein gene, also provide useful promoter 
segjuences. Expression may be either constitutive or 
regulated (inducible), depending on the promoter can be 
10 induced with glucocorticoid in hormone-responsive cells. 

The presence of an enhancer element (enhancer), combined 
vith the promoter elements described above, will usually 
increase expression levels. An enhancer is a regulatory DMA 

15 sequence that can stimulate transcription up to 100 0-fold 
When linked to homologous or heterologous promoters, with 
synthesis beginning at the normal RNA start site. Enhancers 
are also active when they are placed upstream or downstream 
from the transcription initiation site, in either normal or 

20 flipped orientation, or at a distance of more than 1000 
nucleotides from the promoter [Haniatis et el. (1987) 
glance 236:1 237; Alberts et al. (1989) flolecglar glology of 
the Cell . 2nd ed.}. Enhancer elements derived from viruses 
may be particularly useful, because they usually have a 

25 broader host range. Examples include the SV40 early gene 
enhancer [Dijkema et al (1985) Hffi2__L_±:761] and the 
enhancer /promoters derived from the long terminal repeat 
(MR) of the Rous sarcoma Virus [Gorman et al. (1982b) ProC i 
Mati. Acs«y fie*- 79 ; 67771 and from human cytomegalovirus 

30 [Boshart et al. (1985) f-^ii 4i:52iu Additionally, some 
enhancers are regulatable and become active only in the 
presence of an inducer, such as a hormone or metal ion 
[Sassone-corsi and Borelli (1986) Trends Genet. — 2:215; 
Maniatis et al. (1987) Science 236:1237]. 

35 

A UNA molecule may be expressed intracellular ly in mammalian 
cells. A promoter sequence may be directly linked with the 
DNA mol cule, in which cas th first amin acid at the H- 
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terminus f th roc mbinant protein will always be a 
methionine , which is encoded by the ATG start codon. If 
desired, the N-terminus may be cleaved fr m the protein by 
in vitro incubation with cyanogen bromide. 

5 

Alternatively! foreign proteins can also be secreted from 
the cell into the growth nedia by creating chimeric DNA 
molecules that encode a fusion protein comprised of a leader 
sequence fragment that provides for secretion of the foreign 

10 protein in mammalian cells. Preferably, there are 
processing sites encoded between the leader fragment and the 
foreign gene that can be cleaved either In vivo or in vitro . 
The leader sequence fragment usually encodes a signal 
peptide comprised of hydrophobic amino acids which direct 

IS the secretion of the protein from the cell. The adenovirus 
triparite leader is an example of a leader sequence that 
provides for secretion of a foreign protein in mammalian 
cells. 

20 Usually , transcription termination and polyadenylation 
sequences recognized by mammalian cells are regulatory 
regions located 3 V to the translation stop codon and thus, 
together with the promoter elements, flank the coding 
sequence. The 3* terminus or the mature mRNA is formed by 

25 site-specific post-transcriptional cleavage and polya- 
denylation [Birnstiel et al. (1985) cell 41 :349; Proudfoot 
and Whitelaw (1988} "Termination and 3 9 end processing of 
eukaryotic SNA. In Transcription and splicing (ed. B.D. 
Hames and D.M. Glover); Proudfoot (1989) Trends Biochem. 

30 fici. 14 ; 1051 . These sequences direct the transcription of 
an mRNA which can be translated into the polypeptide encoded 
by the DNA. Examples of transcription 

terminater /polyadenylation signals include those derived 
from SV40 [Sambrook et al (1989) "Expression of cloned genes 

35 in cultured mammalian cells." In Molecular Clonings A 
laboratory Manual! . 

Some genes nay be expressed nor efficiently when introns 
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(also called intervening sequence*) core present. Several 
cDNAs, however, nave been efficiently expressed from vectors 
that lack splicing signals (also called splice donor and 
acceptor sites) (sea e.g., Gothing and Sambrook (1981) 
5 Nature 293 :6201 . Introns are intervening noncoding 
sequences within a coding sequence that contain splice donor 
and acceptor sites. They are removed by a process called 
■splicing," following polyadenylation of the primary 
transcript [Nevins (1983) ^ w '- pav - 52:441; Green 

10 (1986) pnir- R«™ cent. 20 : 671? Padgett et al. (1986) tsam^ 
Blochem. 55 ; 1119; Krainer and Kaniatis (1988) -RHA 
splicing." In Transcript *™ and splicing (ed. B.D. Hemes 
and D.H. Glover) ] . 

15 Usually, the above described components, comprising a 
promoter, polyadenylation signal, and transcription 
termination sequence are put together into expression 
constructs. Enhancers , introns with functional splice donor 
and acceptor sites, and leader sequences may also be 

20 included in an expression construct, if desired. Expression 
constructs are often maintained in a replicon, such as an 
extractor omoscm&l element (e.g., plasmids) capable of stable 
maintenance in a host, such as mammalian cells or bacteria. 
Mammalian replication systems include those derived from 

25 animal viruses, which require trans-acting factors to 
replicate. For example, plasmids containing the replication 
systems of papovaviruses, such as SV40 [Gluzman (1981) fisli 
23.: 1751 or polyamavirus, replicate to extremely high copy 
number in the presence of the appropriate viral T antigen. 

30 Additional examples of mammalian replicons include those 
derived from bovine papillomavirus and Epstein-Barr virus. 
Additionally, the replicon may have two replication systems, 
thus allowing it to be maintained, for example, in mammalian 
cells for expression and in a prooaryotic host for cloning 

35 and amplification. Examples of such mammalian-bacteria 
shuttle vectors include pMT2 [Kaufman et al. (1989) Hal*. 
pbii. Biol. 9 t946 and pHEBO [Shindzu et al. (1986) HQL*. 



SUBSTITUTE SHEET 



WO 93/13202 



25 



7EP92/D3016 



The transformation procedure used depends up n the host to 
be transf rmed. Methods for introduction of heterologous 
polynucleotides into mammalian cells are known in the art 
5 and include dextran-mediated transf ection, calcium phosphate 
precipitation, polytarene mediated transf ection, protoplast 
fusion, electroporation, encapsulation of the 
polynucleotide (s) in liposomes, and direct microinjection of 
the DBA into nuclei. 

10 

Mammalian cell lines available as hosts for expression are 
known in the art and include many immortalized cell lines 
available from the American Type Culture Collection (ATCC) , 
including but not limited to, Chinese hamster ovary (CHO) 
15 cells, HeLa cells, baby hamster kidney (BHK) cells, monkey 
kidney cells (COS), human hepatocellular carcinoma cells 
(e.g., Hep 62), and a number of other cell lines. 

ii. Baculovirus Systems 

20 

The polynucleotide encoding the protein can also be inserted 
into a suitable insect expression vector, and is operably 
linked to the control elements within that vector. Vector 
construction employs techniques which are known in the art. 

25 

Generally, the components of the expression system include 
a transfer vector, usually a bacterial plasmid, which 
contains both a fragment of the baculovirus genome, and a 
convenient restriction site for insertion of the 

30 heterologous gene or genes to be expressed; a wild type 
baculovirus with a sequence homologous to the baculovirus- 
specific fragment in the transfer vector (this allows for 
the homologous recombination of the heterologous gene in to 
the baculovirus genome) ; and appropriate insect ,host cells 

35 and growth media. 

. After Inserting the DNA sequence encoding the protein into 
the transfer vector, the vector and the wild type viral 
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genome are transfected into an insect host cell where th 
vector and viral genome are allowed to recombine. Th 
packaged recombinant virus is xpressed and recombinant 
plaques are identified and purified. Materials and methods 
5 for baculovirus / insect cell expression systems are 
commercially available in kit form from, inter alia, 
Itavitrogen, San Diego CA ("MaxBac" kit)* These techniques 
are generally known to those skilled in the art and fully 

described in Summers and Smith, Xex&s ftqrlCTltrWTal 

10 Thm^ri^nt S tefcion Bulletin Wo. 1555 (1987) (hereinafter 
"Summers and Smith n ) . 

Prior to inserting the DNA sequence enooding the protein 
into the baculovirus genome, the above described components, 

15 comprising a promoter, leader (if desired) , coding sequence 
of interest, and transcription termination sequence, are 
usually assembled into an intermediate transplacement 
construct (transfer vector) . This construct may contain a 
single gene and operably linked regulatory elements; 

20 multiple genes, each with its owned set of operably linked 
regulatory elements; or multiple genes, regulated by the 
same sat of regulatory elements. Intermediate 
transplacement constructs are often maintained in a 
replicon, such as an extrachromosomal element (e.g., 

25 plasoids) capable of stable maintenance in a host, such as 
a bacterium* The replicon will have a replication system, 
thus allowing it to be maintained in a suitable host for 
cloning and amplification. 

30 Currently, the most commonly used transfer vector for 
introducing foreign genes into AcNPV is pAc373. Many other 
vectors, known to those of skill in the art, have also been 
designed. These include, for example, pVL985 (which alters 
the polyhedrin start codon from ATG to ATT, .and which 

95 introduces a *»™pt cloning site 32 basepairs downstream from 
the ATT; see Luckow and Summers, virology (1989) 11:31. 

The plasnid usually also contains th polyhedrin 
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polyadenylation signal (Miller et al. (1988) Ann. Rev, 
Microbiol. . 42,: 177) and a pr caryotic aapioillin-r si stance 
(SffiB) gene and origin of replication for 6 locti n and 
propagation in coli . 

5 

Baculovlrus transfer vectors usually contain a baculovirus 
promoter. A baculovirus promoter is any DKA sequence 
capable of binding a baculovirus RNA polymerase and 
initiating the downstream (5 1 to 3 1 ) transcription of a 

10 coding sequence (e.g. structural gene] into mRNA. A 
promoter will have a transcription initiation region which 
is usually placed proximal to the 5 1 end of the coding 
sequence. This transcription initiation region usually 
includes an RNA polymerase binding site and a transcription 

15 initiation site. A baculovirus transfer vector may also 
have a second domain called an enhancer, which, if present, 
is usually distal to the structural gene. Expression may be 
either regulated or constitutive. 

20 Structural genes, abundantly transcribed at late times in a 
viral infection cycle, provide particularly useful promoter 
sequences. Examples include sequences derived from the gene 
encoding the viral polyhedron protein, Friaeen et al., 
(1986) "The Regulation of Baculovirus Gene Expression," in: 

25 X&S MQleCUlar — Biology of Baculoviruaes (ed. Walter 

Doerfler); EPO Publ. Nos. 127 839 and 155 476; and the gene 
encoding the piO protein, Vlak et al., (1988), J T Gen. 
Yirol, £&:763. 

30 DNA encoding suitable signal sequences can be derived from 
genes for secreted insect or baculovirus proteins, such as 
the baculovirus polyhedrin gene (Car bone 11 et al. (1988] 
Gene. 73:409). Alternatively, since the signals for 
mammalian cell posttranslational aodif ioatione , (such as 

35 signal peptide cleavage, proteolytic cleavage, and 
phosphorylation) appear to be recognized by insect cells, 
and the signals required for secretion and nuclear 
accumulation also appear to be conserved between the 
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invertebrate cells and vertebrat c lis, leaders of non- 
insect origin, such as thos derived from genes encoding 
human a-interTeron, Kaeda et al., (1985), Hotars 215*592; 
human gastrin-releasing peptide, Lebacq-Verheyden et al. r 
5 (1988) , Kni^. c*ll. Biol, 4:3129? human IL-2, Smith et al. , 
(1985) Prai- lrafe'i Acad. Soi» TOA, 12:8404; mouse IL-3, 
(Miyajima et al., (1987) Gene 5£:273; and human 
glucocerebrosidase, Martin et al. (1988) J2H&, 1*99, can also 
be used to provide for secretion in insects. 

10 

A recombinant polypeptide or polyprotein may be expressed 
intracellularly or, if it is expressed with the proper 
regulatory sequences, it can be secreted* Good 
intracellular expression of nonfused foreign proteins 

15 usually requires heterologous genes that ideally have a 
short leader sequence containing suitable translation 
initiation signals preceding an ATG start signal. If 
desired, methionine at the N-terminus may be cleaved from 
the mature protein by in vitro incubation with cyanogen 

20 bromide. 

Alternatively, recombinant polyproteins or proteins which 
are not naturally secreted can be secreted from the insect 
cell by creating chimeric DMA molecules that encode a fusion 

25 protein comprised of a leader sequence fragment that 
provides for secretion of the foreign protein in insects. 
The leader sequence fragment usually encodes a signal 
peptide comprised of hydrophobic amino acids which direct 
the translocation of the protein into the endoplasmic 

30 reticulum. 

After insertion of the DKA sequence and/ or the gene encoding 
the expression product precursor of the protein, an insect 
cell host is co-transformed with the heterologous DKA of the 
35 transfer vector and the genomic DMA of wild type baculovirus 
— usually by co-transf ect ion . The promoter and 
transcription termination sequence of the construct will 
usually comprise a 2-5kb s ction of th baculovirus genom * 
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Methods for Introducing heterologous DNA into the d sired 
sit in the baculovirus virus ar known in the art, (See 
Summer s and Smith supra : Ju et al. (1987); Smith et al*, 
Mol. cell. Biol, (1983) 2:2156; and Luckow and Sunmere 
5 (1989)) . For example, the insertion can be into a gene such 
as the polyhedrin gene, by homologous double crossover 
recombination; insertion can also be into a restriction 
enzyme site engineered into the desired baculo virus gene. 
Killer et al. , (1989) , BioeBsays £:91.The UNA sequence, when 
10 cloned in place of the polyhedrin gene in the expression 
vector, is flanked both 5 f and 3 r by polyhedr in-specific 
sequences and ifi positioned downstream of the polyhedrin 
promoter. 

15 The newly formed baculovirus expression vector is 
subsequently packaged into an infectious recombinant 
baculovirus . Homologous recombination occurs at low 
frequency (between about 1% and about 5%) ; thus, the 
majority of the virus produced after cotransf action is still 

20 wild-type virus. Therefore, a method is necessary to 
identify recombinant viruses. An advantage of the 
expression system is a visual screen allowing recombinant 
viruses to be distinguished. The polyhedrin protein, which 
is produced by the native virus, is produced at very high 

25 levels in the nuclei of infected cells at late tines after 
viral infection. Accumulated polyhedrin protein forms 
occlusion bodies that also contain embedded particles* 
These occlusion bodies, up to 15 (m in size, are highly 
retractile, giving them a bright shiny appearance that is 

30 readily visualized under the light microscope. Cells 
infected with recombinant viruses lack occlusion bodies* To 
distinguish recombinant virus from wild-type virus, the 
transaction supernatant is plagued onto a monolayer of 
.insect cells by techniques known to those skilled in the 

35 art. Namely, the plaques are screened under the light 
microscope for the presence (indicative of wild-type virus) 
or absence (indicative of recombinant virus) of occlusion 
bodi s. "Current Protocols in Microbiology" Vol. 2 (Ausubel 
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at al. eds) at 16.8 (Supp. 10, 1990); simmers and Smith, 
pupra ; Killer et al. (1989) . 

Recombinant baculovlrus expression vectors have been 
S developed for infection into several insect cells. For 
example, recombinant baculovlrus es have been developed for, 
Inter alia ! Andes aeavpti , ftutographa Calif prnjoa, flflBto 
utori . m-nsnrfilla ^lanoaaatar. gpodpptera frug j perfla , end 
■Prlnhnnlusia nl fPCT Pub. Ho. WO 89/046699? Carbonell et 
10 al., (1985) J - Virol. 56: 153 i Wright (1986) HfljaiEs 718 f 
Smith et al., (1983) wm . oil. Biol. 2:2156; and see 
generally, Fraser, et al. (1989) Tn Yttyo PftiH ■ flftY. B l o l r 
2S.:225) . 

15 cells and cell culture media are commercially available for 
both direct and fusion expression of heterologous 
polypeptides in a baculovlrus /expression system; cell 
culture technology is generally known to those skilled in 
the art. sea, e.g. - summers and Smith BSEES- 

20 

The modified insect cells may then be grown in an 
appropriate nutrient medium, which allows for stable 
maintenance of the plasmid(s) present in the modified insect 
host. Where the expression product gene is under inducible 

25 control, the host may be grown to high density, and 
expression induced. Alternatively, where expression is 
constitutive, the product will be continuously expressed 
into the medium and the nutrient medium must be continuously 
circulated, while removing the product of interest and 

30 augmenting depleted nutrients. The product may be purified 
by such techniques as chromatography, e.g., HPLC, affinity 
chromatography, ion exchange chromatography, etc.; 
electrophoresis; density gradient centrifugation; solvent 
extraction, or the like. As appropriate, the product may be 

35 further purified, as required, so as to remove substantially 
any insect proteins which are also secreted in the medium or 
result from lysis of insect cells, so as to provide a 
product which is at least substantially fr of host debris, 
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e.g., proteins, lipids and polysaccharides* 

In ord r to obtain protein expression, reo mbinant h st 
cells derived from the trans formants are incubated under 
5 conditions which allow expression of the recombinant protein 
encoding sequence* These conditions will vary, dependent 
upon the host cell selected. However, the conditions are 
readily ascertainable to those of ordinary skill in the art/ 
based upon what is known in the art. 

10 

ill* Bacterial Systems 

Bacterial expression techniques are known in the art. A 
bacterial promoter is any DMA sequence capable of binding 

15 bacterial RNA polymerase and initiating the downstream (3") 
transcription of a coding sequence (e.g. structural gene) 
into mRNA. A promoter will have a transcription initiation 
region which is usually placed proximal to the 5 1 end of the 
coding sequence. This transcription initiation region 

20 usually includes an UNA polymerase binding site and a 
transcription initiation site. A bacterial promoter may 
also have a second domain called an operator, that may 
overlap an adjacent SNA polymerase binding site at which HNA 
synthesis begins. The operator permits negative regulated 

25 (inducible) transcription, as a gene repressor protein may 
bind the operator and thereby inhibit transcription of a 
specific gene. Constitutive expression may occur in the 
absence of negative regulatory elements , such as the 
operator. In addition, positive regulation may be achieved 

30 by a gene activator protein binding sequence, which, if 
present is usually proximal (V) to the RNA polymerase 
binding sequence. An example of a gene activator protein is 
the catabolite activator protein (CAP) , which helps initiate 
transcription of the lao operon in Escherichia , coli (E. 

35 coli) [Raibaud flfc fli. (1984) Annu. Rev. Genet. lft:i73J. 
Regulated expression may therefore be either positive or 
.-negative, thereby either enhancing or reducing 
transcription. 
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Sequences encoding Metabolic pathway enzymes provide 
particularly useful promoter sequences. Examples include 
promoter sequences derived from sugar metabolizing enzymes, 
5 Sttcb as galactose, lactose (las) [Chang fife fil. (1977) Eafelttft 
12ft: 1056], and maltose. Additional examples include 
promoter sequences derived from biosynthetio enzymes such as 
tryptophan rtra) [Goeddel fife fil- (1980) ypc t ftgjgs Rflg. 
fi:4057; Yelverton fife al- (1981) ffUffl, K9ifa RMt &:731; U.S. 

10 Patent No. 4,738,921? BPO Publ. Mos. 036 776 and 121 775]. 
The g-laotamase fblal promoter system [Weissmann (1981) "The 
cloning of interferon and other mistakes. 11 In interferon 3 
(ed» I. Gresser) ] , bacteriophage lambda PL [Shimatake fife al* 
(1981) Nature 292:1281 and T5 [U.S. Patent Ko. 4,689,406] 

15 promoter systems also provide useful promoter sequences. 

In addition, synthetic promoters *fhich do not occur in 
nature also function as bacterial promoters. For example, 
transcription activation sequences of one bacterial or 

20 bacteriophage promoter may be j oined vith the operon 
sequences of another bacterial or bacteriophage promoter, 
creating a synthetic hybrid promoter [U.S. Patent 
iro. 4,551,433]. For example, the jag promoter is a hybrid 
trp-lac promoter comprised of both £rp promoter and las 

25 operon sequences that is regulated by the lap repressor 
[Amann fife al- (1983) Gene 2§:167; de Boer fit al- (1983) 
Proc. yatl - Acad. Sci. ££:2l]* Furthermore, a bacterial 
promoter can include naturally occurring promoters of non- 
bacterial origin that have the ability to bind bacterial RMA 

30 polymerase and initiate transcription . A naturally 
occurring promoter of non-bacterial origin can also be 
coupled with a compatible RN& polymerase to produce high 
levels of expression of sane genes in prokaryotes. The 
bacteriophase T7 RNA polymerase/promoter system is an 

35 example of a coupled promoter system [Studier fife fil. (1986) 
Mol. Biol. 189; 113; Tabor fife fil. (1985) Prog Wtlt AMflt 
sci. 82 :10741. In addition, a hybrid promoter can also be 
comprised of a bacteriophag promoter and an L S&li 
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operator ragi n (EPO Publ. No. 267 851). 

In addition to a functioning promoter sequence, an efficient 
rlbosome binding site is also useful for the expression of 
5 foreign genes in prokaryotes. In £. oo^i . the rlbosome 
binding site is called the Shlne-Dalgarno (ED) sequence and 
Includes an initiation codon (ATG) and a sequence 3-9 
nucleotides in length located 3-n nucleotides upstream of 
the initiation codon [Shine s£ ai. (1975) Nature 221:34]. 

10 The SD sequence is thought to promote binding of mBlTA to the 
rlbosome by the pairing of bases between the SD sequence and 
the 3' and of fi. coli 16S rRNA [Steitz £t al. (1979) 
"Genetic signals and nucleotide sequences in messenger RNA." 
In prolog Icq* RgggjlftUgP pev$lopnent j <Svm fiffl)rftffS*Pff 

15 (ed. R.F. Goldberger) ] . To express eukaryotic gene* and 
prokaryotic genes with weak ribosome-binding site [Sambrook 
et al . (1989) "Expression of cloned genes in Escherichia 
coli , « In Molecular Cloning: A Laboratory Manual! . 

20 A DNA molecule may be expressed intracellular ly. A promoter 
sequence may he directly linked with the DNA molecule, in 
• which case the first amino acid at the N-terainus will 
always be a methionine, which is encoded by the ATG start 
codon- If desired, methionine at the N-terminus may be 

25 cleaved from the protein by la vitro incubation with 
cyanogen bromide or by either In vivo on in vitro incubation 
with a bacterial methionine N -terminal peptidase (EPO Publ. 
No. 219 237). 

30 Fusion proteins provide an alternative to direct expression. 
Usually, a DNA sequence encoding the N-terainal portion of 
an endogenous bacterial protein, or other stable protein, is 
fused to the 5 1 end of heterologous coding sequences. Upon 
expression, this construct vill provide a fusion of the two 

35 amino acid sequences. For example, the bacteriophage lambda 
cell gene can be linked at the 5 1 terminus of a foreign gene 
and expressed in bacteria. The resulting fusion protein 
preferably retains a Site f r a processing enzyme (factor 
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Xa) to cleave the bacteriophage pr tein from the f r ign 
gene [Nagai et aj,. (1984) HatUEfi Jfla:810] . Fusion pr teins 
can also be made with sequences from th lfl£Z [Jia fit fll. 
(1987) Gene ££:197], troE [Allen et ttlr (1987) J. 
5 Biotebhnol. 5:93; Makoff st Al- (1989) J. G*n- MigrpfrlQlt 
135:111, and Chav [EPO Publ. No. 324 647] gene*. The DNA 
sequence at the junction of the two amino acid sequences may 
or may not encode a cleavable site. Another example is a 
ubiquitin fusion protein. Such a fusion protein is made 
10 with the ubiquitin region that preferably retains a site for 
a processing enzyme (e.g. ubiquitin specific processing- 
protease) to cleave the ubiquitin from the foreign protein, 
(through this method, native foreign protein can be isolated 
[Miller et al. {iqaq^ Bio /Technology 7:6987. 

15 

Alternatively/ foreign proteins can also be secreted from 
the cell by creating chimeric DNA molecules that encode a 
fusion protein comprised of a signal peptide sequence 
fragment that provides for secretion of the foreign protein 

20 in bacteria [U.S. Patent No* 4,336,336]. The signal 
sequence fragment usually encodes a signal peptide comprised 
of hydrophobic amino acids which direct the secretion of the 
protein from the cell. The protein is either secreted into 
the growth media (gram-positive bacteria) or into the 

25 periplasmic space, located between the inner and outer 
membrane of the cell (gram-negative bacteria) . Preferably 
there are processing sites, which can be cleaved either in 
vivo or in vitro encoded between the signal peptide fragment 
and the foreign gene. 

30 

DNA encoding suitable signal sequences can be derived from 
genes for 6ecreted bacterial proteins, such as the £. soli 
outer membrane protein gene (flBB&) [Masui fit al. (1983) , in: 
flffl^iMAff tei Manipul ation of Gene Expression; Ghrqyeb fit fll- 
35 (1984) *hro 2:2437] and the £. fiali alkaline phosphatase 
signal sequence (phoAl [Oka fit al- (1985) FTPCi Iftttl. ASflflt 
pgj. £2.: 7212] . As an additional example, the signal 
sequenc f the alpha-amylas gen from various Bacillus 
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strains can be used to se ret heterologous proteins from 
£. subtil is [Palva £fc (1982) Proc. Natl, Acad, fici. USA 

22:5582; EPO Publ. No. 244 042]. 

5 Usually, transcription termination sequences recognized by 
bacteria are regulatory regions located 3' to the 
translation stop codon, and thus together with the promoter 
flank the coding sequence. These sequences direct the 
transcription of an mRNA which can be translated into the 

10 polypeptide encoded by the DNA. Transcription termination 
sequences frequently include DNA sequences of about 50 
nucleotides capable of forming stem loop structures that aid 
in terminating transcription. Examples include 

transcription termination sequences derived from genes with 

15 strong promoters, such as the tro gene in £. coli as well as 
other biosynthetic genes. 

Usually, the above described components, .comprising a 
promoter, signal sequence (if desired) , coding sequence of 

20 interest, and transcription termination sequence, are put 
together into expression constructs. Expression constructs 
are often maintained in a replicon, such as an 
extrachromosomal element (e.g., plasmids) capable of stable 
maintenance in a host, such as bacteria* The replicon will 

25 have a replication system, thus allowing it to be maintained 
in a procaryotic host either for expression or for cloning 
and amplification. In addition, a replicon may be either a 
high or low copy number plasmid. A high copy number plasmid 
will generally have a copy number ranging from about 5 to 

30 about 200, and usually about 10 to about 150. A host 
containing a high copy number plasmid will preferably 
contain at least about 10, and more preferably at least 
about 20 plaemids. Either a high or low copy number vector 
may be selected, depending upon the effect of the vector and 

35 the foreign protein on the host. 

Alternatively, the expression constructs can be integrated 
into th bacterial g nome with an integrating v ctor. 
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Integrating vectors usually contain at least one sequence 
homologous to th bacterial chromosome that allows the 
vector t Integrate. Integrations appear to r suit from 
recombinations between homologous DNA in the vector and the 
5 bacterial chromosome. For example, integrating vectors 
constructed with DNA. from various Bacillus strains integrate 
into the Bacillus chromosome fZPO Publ. Mo. 127 328) • 
Integrating vectors may also be comprised of bacteriophage 
or transposon sequences. 

10 

Usually, extractoromosomal and integrating expression 
constructs may contain selectable markers to allow for the 
selection of bacterial strains that have been transformed. 
Selectable markers can be expressed in the bacterial host 

15 and may include genes which render bacteria resistant to 
drugs such as ampicillin r chloramphenicol, erythromycin, 
kanamycin (neomycin), and tetracycline [Davies s£ al- (1978) 
Anmi, Rev. Microbiol. 32*46?]. Selectable markers may also 
include biosynthetic genes, such as those in the histidine, 

20 tryptophan x and leucine biosynthetic pathways. 

Alternatively, some of the above described components can be 
put together in transformation vectors. Transformation 
vectors are usually comprised of a selectable market that is 
25 either maintained in a replicon or developed into an 
integrating vector, as described above. 

Expression and transformation vectors, either extra- 
chromosomal replicons or integrating vectors r have been 

30 developed for transformation into many bacteria. Tor 
example i expression vectors have been developed fori j.nter 
alia, the following bacteria: Bacillus eubtilic [Palva £t 
(1982) Proc. Watl . Acad. Sci. USA 2*5 5582; EPO Publ. 
Nos. 036 259 and 063 953; PCT Publ. No. WO 84/04541], 

35 Escherichia coli [Shimatake £t al. (1981) ffatura 222:128; 
amann fife sl» (1985) Gene AO: 183; Studier ££ (1986) £u 
f fol. Biol. 189 :113? EPO Publ. Nos* 036 776, 136 829 and 136 

907], Streptococcus cremoris [Powell fit Al. (1988) AhbLl 
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Environ. Microbiol. 54i6S5f; Strcptoc ecus lividans [Powell 

St Al. (1988) AddJL, gllYlrqn, Microbiol. H:655], 

Streptomyoes lividans [U.S. Patent No. 4,745,056]. 

5 Methods of introducing exogenous DNA into bacterial hosts 
are veil -known in the art, and usually include either the 
transformation of bacteria treated with CaCl 2 or other 
agents, such as divalent cations and DMSO. DMA can also be 
introduced into bacterial cells by electroporation. 

10 Transformation procedures usually vary with the bacterial 
species to be transformed, see e.g., [Hasson at al. (1989) 
FEMS Microbiol. Lett. £fl;273; Palva fit Al. (1982) Proe. 
Wa^l. Acad. Scl. PSA 79:3382; EPO Publ. NOS. 036 259 and 063 
953; PCT Publ. No. ho 84/04541, Bacillus], [Miller fit al. 

15 (1988) Proc. Watl. Acad. Sci. £5.: 856; Wang fit Al- (1990) it. 
Bacterlol. 122:949, Campylobacter], [Cohen fit Al. (1973) 
Prpp. Wfttt. ft<?nfl, flcAt £2:2110; Dover fit Al- (1988) Nucleic 
Acids Res. 16:6127; Kushner (1978) "An improved method for 
transformation of Escherichia coll with ColZl-derived 

20 plasmids. In Genetic Engineering: Proceedings of the 
International Symposium on Genetic Engineering (eda. H.W. 
Boyer and S. Nicosia); Mandel et aI. (1970) J. Mol. Biol. 
51:159; Taketo (1988) Biochim. Bioohva. Acta 9J9.:318; 
Escherichia], [Chassy fi£ fil. (1987) FEMS Microbiol. Lett. 

25 Mi 173 Lactobacillus]; [Fiedler fit al. (198B) Anal. Biochem 
170 :3a. Pseudomonas]; [Augustin fit Al. (1990) pems 
Microbiol. Lett. £g:203, Staphylococcus), [Barany fit &!• 

(1980) i BflCterlQl. l£i:698; Barlander (1987) 

"Transformation of Streptococcus lactis by electroporation, 

30 in: streptococcal Genetics (ed. J. Ferretti and R. Curtiss 
III); Perry e£ fll- (1961) Infec. immun. 22:1295; Powell fit 
Al. (1988) AppI. Environ. Microbiol. £1:655; Somkuti fit Al- 

(198?) £Cfi£j 4£fe EVTt Cong, Biotechnology 1:412, 

Streptococcus]. 

35 

iv. Yeast Expression 

Xeast ^expression systems ar als Known t no f ordinary 
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skill in the art* A yeast promoter is any DFA sequence 
capable of binding yeast: UNA polymerase and initiating the 
downstream (3*) transcription of a coding sequence ( .g. 
structural gene) into mRNA. A promoter will have a 
5 transcription initiation region which is usually placed 
proximal to the 5' end of the coding sequence. This 
transcription initiation region usually includes an HNA 
polymerase binding site (the 11 TATA Box") and a transcription 
initiation site. A yeast promoter may also have a second 

10 called an upstream activator sequence (UAS) , which, 

if present, is usually distal to the structural gene. The 
UAS permits regulated (inducible) expression, constitutive 
expression occurs in the absence of a UAS. Regulated 
expression may be either positive or negative, thereby 

15 either enhancing or reducing transcription. 

Yeast is a fermenting organism with an active metabolic 
pathway, therefore sequences encoding enzymes in the 
metabolic pathway provide particularly useful promoter 

20 sequences. Examples include alcohol dehydrogenase (ADH) 
(EPO Publ. No. 2B4 044) , enolase, glucokinase, glucose-6- 
phosphate isomerase , glyceraldehyde-3-phosphate- 
dehydrogenase (GAP or GAPDH) , hexokinase, 
pbosphofractokinase, 3-pbosphoglycerate mutaee, and pyruvate 

25 kinase (PyK) (EPO Publ. No. 329 203). The yeast PHP? gene, 
encoding aoid phosphatase, also provides useful promoter 
sequences [Hyanohara sfc &1- (1983) Proc. Natl. Acad. Scl. 
USA 80:11. 



30 In addition, synthetic promoters which do not occur in 
nature also function as yeast promoters. For example , UAS 
sequences of one yeast promoter may be joined with the 
transcription activation region of another yeast promoter, 
creating a synthetic hybrid promoter • Examples of such 

35 hybrid promoters include the ADH regulatory sequence linked 
to the GAP transcription activation region (U.S. Patent Nos. 
4,876,197 and 4,880,734). Other examples of hybrid 
promoters include promoters which o nsiet of th regulatory 
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sequences of either the ADH2 , GAL4 , GALlp , OR PH03 genes, 
combined with th transcriptional activation regi n of a 
glyo lytic enzyme g ne such as GAP or PyX {EPO Publ. Ho. 164 
556) . Furthermore, a yeast promoter can include naturally 
5 occurring promoters of non-yeast origin that have the 
ability to bind yeast RNA polymerase and initiate 
transcription. Examples of such promoters include, inter 
alia . [Cohen j£ &1* (1980) Proc, Natl. Acad. Soi. PSA 
72:1078; Henikoff sfc g±. (1981) Nature 221:835; Hollenberg 

10 £fc fil. (1981) Curr. Topics Microbiol. Immunol. ££:119; 
Hollenberg et &1. (1979) "The Expression of Bacterial 
Antibiotic Resistance Cenes i the Yeast Saccharomyces 
cerevisiae," in: Plflfrinlfls of Mffflisali EnYironfflffltttl and 
Commercial Importance (eds. K>N> Timmis and A. Puhler) ; 

15 Mercerau-Puigalon e£ ai. (1980) G^ne 11:163; Panthier at al . 

(L980) curr, Genet. 2:109;] ■ 

A DMA molecule may be expressed intracellular ly in yeast. 
A promoter sequence may be directly linXed with the DNA 
20 molecule, in which case the first amino acid at the N- 
terminus of the recombinant protein will always be a 
methionine, which is encoded by the AT6 start codon. If 
desired, methionine at the N-terminus may be cleaved from 
the protein by in vitro incubation with cyanogen bromide. 

25 

Fusion proteins provide an alternative for yeast expression 
systems, as well as in mammalian, baculovirus, and bacterial 
expression systems. Usually, a DNA sequence encoding the N- 
terminal portion of en endogenous yeast protein, or other 

30 stable protein, is fused to the 5 1 end of heterologous 
coding sequences. upon expression, this construct will 
provide a fusion of the two amino acid sequences. Por 
example, the yeast or human superoxide dlsmutase (SOD) gene, 
can be linked at the 5* terminus of a foreign, gene and 

35 expressed in yeast. The DNA sequence at the junction of the 
two amino acid sequences nay or may not encode a cleavable 
site. See e.g., EPO Publ. No. 196 056. Another example is 
a ubiguitin fusion protein* Such a fusion protein is made 
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with tha ubiquitin region that preferably r tains a site far 
a processing enzyme (e.g. ubiquitin- specific processing 
protease) to cleave the ubiquitin from the foreign pr tein. 
Through this method, therefore, native foreign protein can 
5 be isolated (see, e.g., PCT Publ. Ho. WO 88/024066). 

alternatively, foreign proteins can also be secreted from 
the cell into the growth media by creating chimeric DNA 
molecules that encode a fusion protein comprised of a leader 

10 sequence fragment that provide for secretion in yeast of the 
foreign protein. Preferably, there are processing sites 
encoded between the leader fragment and the foreign gene 
that can be cleaved either ift vivo or In Ylfi£. The leader 
sequence fragment usually encodes a signal peptide comprised 

15 of hydrophobic amino acids which direct the secretion of the 
protein from the cell. 

DNA encoding suitable signal sequences can be derived from 
genes for secreted yeast proteins, such as the yeast 
20 invertase gene (EPO Publ. No. 012 873; JPO Publ. No. 
62,096,086) and the A-factor gene (U.S. Patent Ho. 
4,588,684). Alternatively, leaders of non-yeast origin, 
such as an interferon leader, exist that also provide for 
secretion in yeast (EPO Publ. Ho. 060 057) . 

25 

A preferred class of secretion leaders are those that employ 
a fragment of the yeast alpha-factor gene, which contains 
both a "pre" signal sequence, and a "pro" region, the types 
of alpha-factor fragments that can be employed include the 

30 full-length pre-pro alpha factor leader (about B3 amino acid 
residues) as well as truncated alpha-factor leaders (usually 
about 25 to about 50 amino acid residues) (U.S. Patent Hos. 
4,546,083 and 4,870,008; EPO PUbl. NO. 324 274). Additional 
leaders employing an alpha-factor leader fragment that 

35 provides for secretion include hybrid alpha-factor leaders 
made with a presequence of a first yeast, but a pro-region 
from a second yeast alphaf actor. (See e.g., PCT Publ. No. 
WO 89/02463.) 
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Usually, transcription termination sequences recogniz d by 
yeast are r gulatory regions located 3 1 to the translation 
stop codon, and thus together with the promoter flanX the 

5 coding sequence . These sequences direct the transcription 
of an mRNA which can be translated into the polypeptide 
encoded by the DNA. Examples of transcription terminator 
sequence and other yeast-recognized termination sequences , 

. such as those coding for glycolytic enzymes. 

10 

Usually, the above described components, comprising a 
promoter, leader (if desired) , coding sequence of interest, 
and transcription termination sequence, are put together 
into expression constructs. Expression constructs are often 

15 maintained in a replicon, such as an extrachromosomal 
element (e.g., plasmids) capable of stable maintenance in a 
host, such as yeast or bacteria. The replicon may have two 
replication systems, thus allowing it to be maintained, for 
example, in yeast for expression and in a procaryotic host 

20 for cloning and amplification. Examples of such yeast* 
bacteria shuttle vectors include YEp24 [Botstein ££ Al. 
(1979) Gene 8; 17-241. pCl/1 [Brake fi£ si. (1964) Proc. Natl. 
Acad. Sci USA £1:4642-4646], and YRpl7 [Stinchcomb fit al . 
(1982) J. Mol. Biol. 158 :1571* *n addition, a replicon may 

25 be either a high or low copy number plasmid. A high copy 
number plasmid will generally have a copy number ranging 
from about 5 to about 200, and usually about 10 to about 
150. A host containing a high copy number plasmid will 
preferably have at least about: 10, and more preferably at 

30 least about 20. Enter a high or low copy number vector may 
be selected, depending upon the effect of the vector and the 
foreign protein on the host. See e.g. f Brake fit al. , aUfilfi* 

Alternatively, the expression constructs can be integrated 
35 into the yeast genome with an integrating vector. 
Integrating vectors usually contain at least one sequence 
.homologous to a yeast chromosome that allows the vector to 
Integrate, and preferably contain two homol gous sequences 
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flanking the expression construct, integrations appear to 
result from recombinations between homologous DMA in the 
vector and the yeast chromosome [Orr -Weaver e£ aj,. (1983) 
ffft ^nrt g <n unitvai. 101:228-245] . An integrating vector nay 
5 be directed to a specific locus in yeast by selecting the 
appropriate homologous sequence for inclusion in the vector, 
see Orr-Weaver at fil- , S2BE&.- One or more expression 
construct may integrate, possibly affecting levels of 
recombinant protein produced [Rine e£ al. (1983) PrpCi Hatl. 

10 Acad, acl. UBA 80 1 67501 . The chromosomal sequences included 
in the vector can occur either as a single segment in the 
vector, which results in the integration of the entire 
vector, or two segments homologous to adjacent segments in 
the chromosome and f latticing the expression construct in the 

15 vector, which can result in the stable integration of only 
the expression construct. 

Usually, extra chromosomal and integrating expression 
constructs may contain selectable markers to allow for the 

20 selection of yeast strains that have been transformed. 
Selectable markers may include biosynthetic genes that can 
be expressed in the yeast host, such as ADB2 , JSSAr 2*522. /• 
2322,, and ALG7 . and the G418 resistance gene, which confer 
resistance in yeast cells to tunioamycin and C418, 

25 respectively. In addition, a suitable selectable marker may 
also provide yeast with the ability to grow in the presence 
of toxic compounds, such as metal. For example, the 
presence of CTJP1 allows yeast to grow in the presence of 
copper ions [Butt e£ Al. (1987) Tffira-nhiol. Rev. £1:351]- 

30 

Alternatively, some of the above described components can be 
put together into transformation vectors. Transformation 
vectors are usually comprised of a selectable marker that is 
either maintained in a replicon or developed; into an 
35 integrating vector, as described above. 

Expression and transformation vectors, either 
ertrachromosomal replicons or integrating vectors, have b en 
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d veloped for transf rmati n into many yeasts. For example, 
xpression vectors have b n d v loped for, Inter alia , the 
f llowing yeasta: Candida albicans [Kurtz, fit Ai« (1986} Mol. 
Cell. Biol. 6:1421 . Candida maltose [Kunze, fit fil- (1985) 
5 Basio Microbiol. 25x1411. Hansenula polymorph a [Glee son, fit 
fil. (1986) J. Cen. Microbiol. 122:3459; Roggenkanp fit &!• 
(1986) Mol. Gen. Genet. 2fl2:302], Kluyveromyces fragilis 
[Das, e± fil- (1984) J. Bacterid. 12£:1165], Kluyveromyces 
lactis [De Louvencourt fit &1. (1983) J. Bacteriol. 154:737; 

10 Van den Berg fit Al* (1990) Bio /Technology 8:135], Pichia 
guillerimondii [Kunze fit fil. (1985) J, Basic Microbiol. 
22:141], Pichia pastoris [Cregg, at fil- (1985) Mol. Cell. 
Biol. £:3376; U.S. Patent Nos. 4,837,148 and 4,929,555], 
Saccharomyces cerevisiae [Hinnen fit fil- (1978) Proc. watl- 

15 Acad, set, usa 22:1929; Ito fit fil. (1983) j ff gacterApl, 
121:163], Schizosaccharomyces pombe [Beach and Nurse (1981) 
flature 2££:706], and Yarrowia lipolytlca [Davidow, fit al. 
(1985} Cfflffi Senet, lfl:38047l Gaillardin, at fil- (1985) 
Curr. Genet. 10:49] . 

20 

Methods of introducing exogenous DMA into yeast hosts are 
well-known in the art, and usually include either the 
transformation of spheroplasts or of intact yeast cells 
treated with alkali cations. Transformation procedures 

25 usually vary with the yeast species to be transformed. See 
e,g., [Kurtz fit fil- (1986) Mol. Cell. Biol. £:142; Kunze fit 
al. (1985) J. Basic Microbiol. 22:141; Candida]; [Gleeson 
fit fil- (1986) J. Gen. Microbiol. 132 = 3450? Roggenkamp fit fil- 
(1986) Mol« Gen. Genet. 2fl2:302; Hansenula); [Das fit fil* 

30 (1984) J- Bacteriol. 151:1165; De Louvencourt fit fil- (1983) 
j- Bacterioi. 124:1165; van den Berg fit fil. (1990) 
Mfi/lfiSimsalfiSa S.:135; Kluyveromyces]; [Cregg fit fil. (1985) 
Mol. Cell. Biol. 5:3376; Kunze fit fil. (1985) J, BflfiiC 
Microbiol, 25:141; U.S. Patent Nos. 4,837,148 and ,4,929,555; 

35 Pichia]; [Hinnen fit fil- (1978) Proc. Natl. Acad. Scl. USA 
22;1929; Ito fit fil- (1983) J. Bacteriol. 121:163 
Saccharomyces]; [Beach and NUrse (1981) Nature 2££:706; 
SQhiaosaoohar rayc s]; [Davidow fit fil* (1985) Curr. Genet. 
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IP,: 39 ; Gaillardin fit al. (1985) Curr, Genet t lfi:49; 
Yarrowia] . 

5 

A fragment of the gene for LT was extracted fro* plasmid 
EWD299 [Dallas W.S.. Gill D.M. and Falkow S., 1979, J. 
Bacterid., iaa, 850-858 J by digestion with the restriction 
enzymes Smal and EcoRI, and was redoned in the vector 
10 Bluescript KS suitable for producing single strands of DMA 
[Sambrook J., Fritsch E. and Haniatis, T. "Molecular 
Cloning", Cold Spring Harbor]. 

BR313 cells were transformed by the clones thus obtained and 
15 allowed to grow for 14 hours in a culture medium consisting 
of Luria Broth with the addition of 1 re/ml of uridine. 

A series of synthetic oligonucleotides (listed in Table 1 
below), containing the mutation, or the desired bases 
20 instead of the natural ones, and a sequence of 10 bases 
upstream and 10 downstream of the same natation, identical 
to the natural ones, was first of all synthesised chemically 
and then phosphorylated, 1.5 pmol thereof being treated at 
37 "»C with 5 units of kinase. 

25 

After halting the reaction with a 100 mM EDTA solution, the 
oligonucleotides were annealed to the single strand 
containing the LT gene, by heating for 5 minutes at 70-C and 
cooling slowly for about one hour in ice. 

30 

At that stage there was added to this cold solution (25 Ml) 
a solution of free nucleotides, the enzyme DNA ligase and 
the enzyme DNA polymerase, in a final volume of 100 Ml* 

35 The solution thus obtained was kept for five minutes in ice, 
five minutes at ambient temperature and two hours at 37 *C. 

Suitable cells of E. coli were transformed with the reacti n 
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mixture, In acc rdanc with the usual t chniques [Sambrook 
J., Fritsch E. and Haniatia T. "Molecular Cloning" C Id 
Spring Harbor], and the site-directed mutagenesis vas 
checked by sequencing of the clones obtained * 

5 

The Smal-EcoRi fragment containing the various mutations vas 
substituted for the original Smal-EcoRI insert in the 
plasmid EWD299. 

10 The strains which encode the mutated toxin were then grown 
in 10 ml of Luria Broth for 12 hours at 37 # C. 

The cultures were centrif uged and the precipitate containing 
the cells was resuspended in 300 ml of a solution containing 
15 25% of sucrose and 50 mN of Tris buffer at pH8, and the 
mixture was treated for one hour at ambient temperature with 
1 mg/ml of a solution of Polymixin B. 

The presence of the toxoid in the periplasmatic supernatant 
20 liquor was verified by means of Western Blot and its 
toxicity was evaluated by the inducement or lack of 
inducement of morphological changes in Yl cells (see Table 

25 Yl cells are adrenal tumour epithelial cells which become 
markedly more rounded When treated with a solution 
containing CT or LT [Yasanure Y. , Buonassisl v. and Sato G. , 
"Clonal analysis of differentiated function in animal cell . 
cultures 11 , Cancer Res., 1966, 2Jk, 529-535]. The toxicity of 

30 CT and LT is correlated with this morphological transition. 
The periplastic supernatant is diluted with a solution of 
710 medium, horse serum 1.3%, glutamine and gentamycin to 
lesser and lesser concentrations and Yl cells (250000 
cells/ml) are incubated with the resulting solutionis for 48 

35 hours at 37 *C under an atmosphere of C0 2 « The morphology of 
the cells is evaluated. 

m all cases r immunog nicity was shown by correct assembly 
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of the complete toxoid and by cross reaction f the toxoid 
with, antib dy to the wild type LT. 

The results are shown in Table I below* 

In this Table (and in Table II below) the toxicity symbols 
mean as follows: 



10 



15 



20 



25 



30 



35 



toxic after dilution 1:2000 (wild type toxicity) 
toxic up to dilution 1:250 

toxic up to dilution 1:64 

not toxic, even undiluted 

TABLE I 



Ff gwmlft Motion 



1.1 Iff Val- 

1.2 LT Val- 

1.3 LT val- 

1.4 LT Ser- 

1.5 LT Val- 

1.6 LT val- 

1.8 LT His- 

1.9 LT Tyr- 

1.10 LT Tyr- 

1.11 LT Tyr- 

1.12 LT Pro- 



53-Asp 
53-GlU 
53 -Tyr 
63-Lys 
97-Lys 
97-Tyr 

107-GlU 
104-Lys 
104-Asp 
104-Ser 
106-Ser 



ftHcrnTiiiclp nt^A SpmiflTiee T oxicity 

29 1- ACCGGCTTTGATAGATATGAT-3 11 

291-ACCGGCTTTGaAAGATATSAT-3 11 

291-ACCGGCTTTT1CAGATATGAT-3 11 

3 22 -K3TTTCCACTAAGCTTAGTTTG-3 4 2 

4 24 -ATCTTTAATAAGAATGATGTA-444 

424-ATGTTTAATTACAATGATGTA-444 

454 -TACAGCCCTGAGCCATATGAA-4 7 4 V* 

4 45-ATTAGCGTAAAGAGCCCT-462 

445 -ATTAGCGTAGATAGCCCT-4 62 

447 -TAGCGTAAGTAGCCCTCA-464 

453 -ATACAGC AGCCACCCATA-47 0 



Two mutation of serine CSer-114 -Glu: 4 77-GGAGGTGAAGCGTTAGG- 
494 and Ser-114— Lysi 477-GGAGGTTAAAGCGTTAGG-494) were also 
shown to exhibit substantially reduced toxicity. 



comparative Examples 



A LT 
40 B LT 
C LT 
D LT 
B LT 
F LT 
45 6 LT 
B LT 
I LT 
J LT 
X LT 
SO L LT 
H LT 
B LT 
LT 



LT Wild Type 

Arg-210-Acp 

Leu-41-Phe 

His-44-Phe 

Ala-45-Tyr 

Arg-54-Ala 

Arg-54-Lys 

Tyr-59-Met 

Val-60-Gly 

ser-68-Lys 

Ser-68-Pro 

His-70-Pro 

Ala-72-Arg 

Ala-72-His 

Arg-192-Asn 



769 -ATATATCTCAACGAAIATCAA-7 8 9 + 

11 3 -ATATTAATTTCTATGATC-13 0 NA 

12 1-CTTTATGAITTTGCGAGA- 138 NA 

125-ATGATCACTATAGAGGAA-142 HA 

152-GCTTTGTCGOGTATGATG-169 ++ 

15 l-GGCTTTGTCAAGTATGATGAT-17 1 ++ 
167-ATGACGGAATGGTTTCCA-184 , 

169-GAOGGATATGGATCCACTTCT-189 HA 

193 -AGTTTGAGAAAGGCTCACTTA-213 ++ 

193-AGTTTGAGACCAGCTCACTTA-213 HA 

19 9-AGAAGTGCTCCTTTAGCAGGA-219 HA 

205-GCTCACTTAAGGGGACAGTCT-225 ++ 

2 05 -GCTCACTTACATGG ACAGTCT-22 5 4H- 

565-GATTCATCAATTACAATCACA-585 4H- 
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(HA means "not assembled", i, the holot xin h% is n t 
formed at all) 

5 Example 2 - Detoxified CT 

The procedure followed in the case of the gene for the toxin 
CT is analogous to that described above ♦ 

10 A fragment containing the gene for a CT vas anplif ied by 
neans of the polymerase chain reaction (PGR) technique from 
plasmid pCT322 . An alternative and equivalent source of the 
CT gene is plasmid pJH17 (Pearson et al, FHAS USA, 2&/ 
(1982), 2976-2980). 

19 

The following two synthetic primers were used: 

1) GGCAGATTCIASACCTCCTGATGAAATAAA 

20 2) TCAAGTSTGG CG^gQICriT AA!ITTGCC ATACTAATTGCGGCAATCGCAT 

containing respectively an Xbal site and an artificial 
Hindlll site (shown underlined] • 

25 The resulting amplified fragment, Xbal-Hindlll, which has a 
length of 1074 base pairs r contains the codons of the two 
sub-units, A and B, but not the sequence encoding the leader 
peptide of the A sub-unit. This fragment was recloned in 
Bluescrlpt KS vector and was treated in accordance with the 

30 procedure described above for LT, so as to effect the site- 
directed mutagenesis. 
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TABLE II 



10 



15 



20 



feeamnla Mutation nHgomiclqrt ida S«cm&nce TWflfilty 



2*1 CTVal-53-Asp 
2*2 CTSer-63-Lys 

2.3 CTVal-97-Lye 

2.4 CTSer-106-Pro 

ffomparative gxampl* 

A CP Arg-192-Asn 
B CTArg-54-His 



ACGGGATTTGACAGGC ACGA X 
GTTTCCACCAAGATTAGTTTG 
ATGTTTAACAAGAATGATGTA 
GGCATACAGTAGCCATCCAOA 



GAATGCTCCAAACTCATCGAT 
6GATTTGTTCATCAC6A3XSAT 



The following mutations also proved to abolish toxicity:*- 
107 -Asn (TACAGTCCTAACCCAGATGAA) , Glu-110-Ser 
(TCATCC&G&TTCGCAAGAAGT) , Glu-112-Ala ( CAGATGAACAAGCTCTTTCTG) 
and Ser~114-Glu (CAAGAAGTTGAAGCTTTAGGT) • 

It will be understood that the invention is described above 
by way of example only and modifications of detail nay be 
Bade within the scope and spirit of the invention. 
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CLAIMS ; 

1. An immunogenic det xif ied protein comprising th amino 
acid sequence of subunit A of a cholera toxin (CT-A) or a 
f ragment thereof or the aaino acid sequence of subunit a of 
an Escherichia coli heat labile toxin (LT-A) or a fragment 
thereof wherein one or more amino acid? at, or in positions 
corresponding to Val-53, Ser-63, Val-97, Tyr-104 or.Pro-106 
are replaced with another amino acid. 



2. An immunogenic detoxified protein according to claim 
1 wherein additionally one or more amino acids at, or in 
positions corresponding to Arg-7, Asp-9, Arg-ii, His -4 4, 
Arg-54, ser-6l, His-70, His-107 # Glu-110, Glu-112, Ser-114, 

15 Trp-127, Arg-146 or Arg-192 are replaced. 

3. An immunogenic detoxified protein according to claim 
l or 2 comprising one or more of the following amino acid 
replacements Val-53 -Asp, Val-53-Glu, Val-53-Tyr, Ser-63-Lys, 

20 Val-97-Lys, Val-97-Tyr, His-107-Glu, Tyr-104-Lys, Tyr-104- 
Asp, Tyr-104-Ser, Pro-106-Ser, Ser-114-Glu, Ser-114-Lys. 

4 • An immunogenic composition for use a vaccine 
comprising an immunogenic detoxified protein according to 
25 any one of the preceding claims and a pharmaceut i ca 1 1 y 
acceptable carrier. 



5. A vaccine composition comprising an immunogenic 
detoxified protein according to any one of claims l to 3 and 

30 a piiarmaceutically acceptable carrier. 

6. A vaccine composition according to claim 5 further 
comprising an adjuvant. 

* 

35 7. A DNA sequence encoding an immunogenic detoxified 
protein according to any one of clains 1 to 3. 

8. A veot r carrying a OKA according t dais 7. 
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9. A host cell line transformed with tn vector according 
-to claim 8. 

5 10. A process for the production of an immunogenic 
detoxified protein according to any one of claims 1 to 3 
comprising culturing a host cell according to claim 9. 

11. a process for the production of a DSA according to 
10 claim 7 comprising the steps of subjecting a DNA encoding 

a CT-A or an M-A or a fragment thereof to site-directed 
mutagenesis. 

12. A method of vaccinating a mammal against vibrio 
15 coolers* or an enterotoxigenic strain of Escherichia. 

coli comprising administering an immunologically effective 
amount of an immunogenic detoxified protein according to any 
one of claims l to 3. 

20 13. A process for the formulation of a vaccine according 
to claim 5 comprising bringing an immunogenic detoxified 
protein according to any one of claims 1 to 3 into 
association with a pharmaceutically acceptable carrier. 

25 14. A process for the formulation of a vaccine according 
to claim 6 comprising bringing an immunogenic detoxified 
protein according to any one of claims 1 to 3 into 
association with an adjuvant. 
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LT2 1 -.-.FF T R-A L QQ-AYE PI 38 

LT1 1A 1 -G-R R HN 40 

CT ~ 1 NDDKLYRADSRPPDEIXQSGGLMPRCQSEYFDRCTQMKIN 40 



--E V— NT— N TVT--Q 1 — K — GS- 78 

y 79 

y L A — S Y 80 

LYDHARGTQTGFYRHDDGYVST£ISLRSAHLVGQTILSGH 80 



NE — V-P L-D — G R Y-S-N-FA 118 

-LTIYI ... • IS 116 

m y --Y 120 

STYYrmATAPNHFHT^VLGAXSPJIPDEQEVSALGGIP 120 



L 1 SF-A-EGGMQ D — GDLF-G-TV — N — 158 

— 156 

H I--R E R— N E- 160 

YSQIYGWYRVHFGVLDEQLHRNRGYRDRYYflNLDIAPAAD 160 



— q SNFP M — STP — EQ-VPNNKEFK-GV-I 198 

196 

— r D-Q -Q DSS-TOTGD— N 200 

GYGLAGFPPEHRAWREEPWIHHAPPGCGKAPRSSMSNTCD 200 



SA-NV— KYD-MNFKKLL--RLALTFFM — D-F-GVHGE 241 

-E— N-STIY-R D- .' — EV- . IY .R • 240 

EKTQSLGVKFLDEYQSKVKRQIFSGY . QSDID . THNRI . KDEL 240 



Figure 1 

* 
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U AfiTQGO3AakQMTRTACC0TO<n»ACTC^ 

NGDRLYRADSRFFDHIKRF* 20 



HDDKLYRADSRFPDBX KQSO 20 
C3* aftXGAX0B!EAROTIAXRTCGGGCA8AITC» 

T/P Il liTtU^UBTOrcr ag Rff gT ■ • ♦AArOAOTACTTCaA3aaAOaAJlCTC3lAArQJU^ArT3Uir 

SLMFRaQJIBTrDROTftMHIH 39 

CLMFR9Q8BTFDR0ZftNHIH 40 
CX GGXCXXAMCCAAGROQACACttOTOAjra 

jjf cXX XA aG A TCRCQCQAqAOCAACaCAAACCQO CT^^ 

L2J>B RROTQ70FVRYODOYV 59 

LTOHRROrCTOFVRHDDOYV 40 

cx exzsAXGASCRxac&JWMaA&cr^ 
M Tcc& carixaxjgaM iogTroAOAAOTocTc & 

S2SX»SX.RgABlAGQYXLReY 79 

S?8XSLR8ASLV0QXXI»80B 80 
CS TCCACCTOUVTOlOTCTgAaaU^MCCCaCITAOT^ 

XX Tf n < *TTft / ** T, * , **' T »ATATCOTTATftfrC* AAIAMCTTAATQTTAATOAMTA 

8X.TXYXVXA NMFHVKDV 94 



STYYXYVIAIAFHHFtfVHDV 100 
CT TCXACTTATTATASAIATOraAIAGCW 

la AX!IAQOOZ2l!CACAGCCCTCACCGATATOAACA6 

ISVYCPHFYEQEVSALOaiP 114 



LOAY8FBFOESBV8ALO0XF 120 
CX XIAOOGOCAXACAOXCCrCAXCCAOAXOAACAAOAAOXTXCSOeinAOOTOOOA 



Figure 2a 
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7SQI3fOKTtVHFOVIDBJtLB 136 



CI 



ySQIYOWTHVHFOVLDBQLH 140 

Iff CGXRACAGMAAIAIAflAflACCWXAX^ 

RNRBtftDRYYHHLNXAPABl) 156 

KHROYRDKYYSNLDXAPAAB 160 
CS COTAAXAOTQQCTACAGAOATAOAIRXTACaCT^ 

££ CXn?!ACAGASTA6CAG0TTTCCCACC6GA!ZCACX^ 

GyRLAQPPPDHQAWRBEPWI 176 

©TO'LAOrPPBHRAKRBBPWI 180 
CX OGTTAXaaATTOGCA0QTTTCCCTCCGGAO<3^ 

IX CASawrC^CCaCAAOaZTCTOaAGAXXCAS«AOAA 

ffHAPQCCOD8SRtIXaDTClf 196 

BBAFPOCGVAFR8BX8HXCD 200 

CX CAXCAXGaUXSOCCMOXZOXOtX^^ 

LX ORGOAOAOTC^aAWCTOACCfc^^ 

BBTQFLSTIYLRBYggKVKR 216 



BKTQSLOVKPLDEYQSIVJCR 220 



UP CR(lATATTTTCAQACTAICACTCAQAG(/lUtjACAIRT ATft A CAM ATTCGOOATflAATTATQA 
QXFSDYQSBVDXYKRIRDRL * 

QXF5GYQSDIDXBNRXXDBL * 
CX CAAAZATTnCATCCXATCMTCTOAXAXroA 



Figure 2b 
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